Privacy Preserving Data Mining in Electronic Health Record using K- anonymity and Decision Tree

نویسندگان

  • Anvita Srivastava
  • Gaurav Srivastava
چکیده

In this paper, we present an accurate and efficient privacy preserving data mining technique in Electronic Health Record (EHR) by using k –anonymity and decision tree C4.5 that is useful to generate pattern for medical research or any clinical trials. It is analyzed that anonymization offers better privacy rather than other privacy preserving method like that randomization, cryptography, perturbation and encryption methods. KAnonymity is useful for prevention of identity of patient in medical research; it is useful to avoid linking attack using suppression and generalization process. The objective of this research work is to propose and implement privacy preserving data mining approach, which best suits with EHR systems without impeding the flow of control. The data stored in EHR system is highly confidential which contains information about patient disease. To ensure confidentiality, we are using anonymization of identity revealing attribute before publishing it for other utility purpose. The experimental results show the validity of the proposed approach. Our approach is useful to preserve utility and privacy in healthcare datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attribute-based Access Control for Cloud-based Electronic Health Record (EHR) Systems

Electronic health record (EHR) system facilitates integrating patients' medical information and improves service productivity. However, user access to patient data in a privacy-preserving manner is still challenging problem. Many studies concerned with security and privacy in EHR systems. Rezaeibagha and Mu [1] have proposed a hybrid architecture for privacy-preserving accessing patient records...

متن کامل

k-Anonymous Decision Tree Induction

In this paper we explore an approach to privacy preserving data mining that relies on the k-anonymity model. The k-anonymity model guarantees that no private information in a table can be linked to a group of less than k individuals. We suggest extended definitions of k-anonymity that allow the k-anonymity of a data mining model to be determined. Using these definitions, we present decision tre...

متن کامل

A Review of Privacy Preserving Data Mining Using KADET

Privacy preserving data mining (PPDM) is being very important now-a-days where the data are easily available on the internet. Most of the fields use data mining techniques. PPDM allows users to have good knowledge from the large amount of data without disclosing the data. In this paper, we will give some overview about the existing PPDM techniques and how k-anonymity is used with decision tree ...

متن کامل

Privacy-Preserving Classification and Clustering Using Secure Multi-Party Computation

Nowadays, data mining and machine learning techniques are widely used in electronic applications in different areas such as e-government, e-health, e-business, and so on. One major and very crucial issue in these type of systems, which are normally distributed among two or more parties and are dealing with sensitive data, is preserving the privacy of individual’s sensitive information. Each par...

متن کامل

Hybrid Approach for Data Publishing Using Privacy Preservation Techniques

Now a day’s governmental and nongovernmental organization wants to share their information for the purpose of knowledge discovery. When the data’s are shared individuals personal data or sensitive data which should not be known to others. Eg. Medical record, voters list, census data. This may leads a latest research field called Privacy preserving data mining. To address these issues, released ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015